首页> 外文OA文献 >Fast Bayesian Feature Selection for High Dimensional Linear Regression in Genomics via the Ising Approximation

【2h】

Fast Bayesian Feature Selection for High Dimensional Linear Regression in Genomics via the Ising Approximation

机译：高维线性回归的快速贝叶斯特征选择在基因组学中通过伊辛近似

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature selection, identifying a subset of variables that are relevant forpredicting a response, is an important and challenging component of manymethods in statistics and machine learning. Feature selection is especiallydifficult and computationally intensive when the number of variables approachesor exceeds the number of samples, as is often the case for many genomicdatasets. Here, we introduce a new approach -- the Bayesian Ising Approximation(BIA) -- to rapidly calculate posterior probabilities for feature relevance inL2 penalized linear regression. In the regime where the regression problem isstrongly regularized by the prior, we show that computing the marginalposterior probabilities for features is equivalent to computing themagnetizations of an Ising model. Using a mean field approximation, we show itis possible to rapidly compute the feature selection path described by theposterior probabilities as a function of the L2 penalty. We present simulationsand analytical results illustrating the accuracy of the BIA on some simpleregression problems. Finally, we demonstrate the applicability of the BIA tohigh dimensional regression by analyzing a gene expression dataset with nearly30,000 features.

机译：特征选择（确定与预测响应相关的变量子集）是统计和机器学习中许多方法的重要且具有挑战性的组成部分。当变量的数量接近或超过样本数量时，特征选择尤其困难且计算量大，这对于许多基因组数据集通常是这样。在这里，我们介绍一种新的方法-贝叶斯伊辛近似（BIA）-快速计算L2惩罚线性回归中与特征相关的后验概率。在先验强烈地解决了回归问题的情况下，我们表明，计算特征的边缘后验概率等效于计算伊辛模型的磁化强度。使用平均场近似，我们表明有可能快速计算由后验概率描述的特征选择路径作为L2惩罚的函数。我们提供的模拟和分析结果说明了BIA在一些简单回归问题上的准确性。最后，我们通过分析具有近30,000个特征的基因表达数据集，证明了BIA对高维回归的适用性。

著录项

作者
Fisher, Charles K.; Mehta, Pankaj;
展开▼
作者单位

展开▼
年度 2014
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类

相似文献

外文文献
中文文献
专利

1. Bayesian feature selection for high-dimensional linear regression via the Ising approximation with applications to genomics [J] . Fisher Charles K., Mehta Pankaj Bioinformatics . 2015,第11期

机译：贝叶斯特征选择通过Ising近似进行高维线性回归及其在基因组学中的应用
2. Bayesian feature selection for high-dimensional linear regression via the Ising approximation with applications to genomics [J] . Fisher Charles K., Mehta Pankaj Bioinformatics . 2015,第11期

机译：贝叶斯特征选择通过Ising近似进行高维线性回归及其在基因组学中的应用
3. Fully Bayesian logistic regression with hyper-LASSO priors for high-dimensional feature selection [J] . Li Longhai, Yao Weixin Journal of statistical computation and simulation . 2018,第13a15期

机译：具有高LASSO先验的全贝叶斯逻辑回归用于高维特征选择
4. Fast Feature Selection for Linear Value Function Approximation [C] . Bahram Behzadian, Soheil Gharatappeh, Marek Petrik International Conference on Automated Planning and Scheduling . 2019

机译：线性值函数近似的快速特征选择
5. Sparse partial least squares regression for simultaneous dimension reduction and variable selection with applications to high dimensional genomic data [D] . Chun, Hyonho 2008

机译：稀疏的偏最小二乘回归，可同时进行降维和变量选择，并应用于高维基因组数据
6. FUBAR: A Fast Unconstrained Bayesian AppRoximation for Inferring Selection [O] . Ben Murrell, Sasha Moola, Amandla Mabona, -1

机译：FUBAR：快速不受约束的贝叶斯近似法用于推断选择
7. Bayesian feature selection for high-dimensional linear regression via the Ising approximation with applications to genomics [O] . Charles K. Fisher, Pankaj Mehta 2015

机译：贝叶斯特征选择通过近似与基因组学的近似的高维线性回归

Fast Bayesian Feature Selection for High Dimensional Linear Regression in Genomics via the Ising Approximation

摘要

著录项

相似文献

相关主题

期刊订阅